Chapter 6 



Fourier analysis 



(Historical intro: the heat equation on a square plate or interval.) 

Fourier's analysis was tremendously successful in the 19th century for for- 
mulating series expansions for solutions of some very simple ODE and PDE. 
This class shows that in the 20th century, Fourier analysis has established 
itself as a central tool for numerical computations as well, for vastly more 
general ODE and PDE when explicit formulas are not available. 

6.1 The Fourier transform 

We will take the Fourier transform of integrable functions of one variable 
xeR. 

Definition 13. (Integrability) A function f is called integrable, or absolutely 
integrable, when 



in the sense of Lebesgue integration. One also writes f & L} (R) for the space 
of integrable functions. 

We denote the physical variable as x, but it is sometimes denoted by 
t in contexts in which its role is time, and one wants to emphasize that. 
The frequency, or wavenumber variable is denoted k. Popular alternatives 
choices for the frequency variable are uj (engineers) or ^ (mathematicians), 
or p (physicists) . 
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Definition 14. The Fourier transform (FT) of an integrable function f{x) 
is defined as 



oo 

ikx 



f{k)= / e-''^f{x)dx. (6.1) 



oo 



When f{k) is also integrable, f{x) can he recovered from f{k) by means of 
the inverse Fourier transform (IFT) 



oo 



fix) = e''V» dk. (6.2) 

Intuitively, f{k) is the amplitude density of / at frequency k. The formula 
for recovering / is a decomposition of / into constituent waves. 

The justification of the inverse FT formula belongs in a real analysis class 
(where it is linked to the notion of approximate identity.) We will justify the 
form of (6.2) heuristically when we see Fourier series in the next section. 

The precaution of assuming integrability is so that the integrals can be 
understood in the usual Lebesgue sense. In that context, taking integrals 
over infinite intervals is perfectly fine. If (6.1) and (6.2) are understood as 
limits of integrals over finite intervals, it does not matter how the bounds are 
chosen to tend to ±oo. 

One may in fact understand the formulas for the FT and IFT for much 
larger function classes than the integrable functions, namely distributions, 
but this is also beyond the scope of the class. We will generally not overly 
worry about these issues. It is good to know where to draw the line: the 
basic case is that of integrable functions, and anything beyond that requires 
care and adequate generalizations. 

Do not be surprised to see alternative formulas for the Fourier transfom 
in other classes or other contexts. Wikipedia lists them. 

Here are some important properties of Fourier transforms: 

• (Differentiation) 

f'{k)=tkf{k). 

Justification: integration by parts in the integral for the FT. 
(Translation) If g{x) — f{x + a), then 

g{k) = e^'^fik). 
Justification: change of variables in the integral for the FT. 
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Let's see some examples of FT. 
Example 17. Let 

„, ,1 /-n/^ if X E [—a, a]; 

^^""^ ~ ^X[-a,a](a;j - j Q otherwise. 

Then 

1 r ^-ikx,^ _ sin(A)a) 



r/izs function is a scaled version of the sine function, 

sinc[k) = — - — . 

K 

It is easy to check by L 'Hospital's rule that 

sinc{0) = 1. 

At k ^ oo, sinc{k) decays like 1/k, but does so by alternating between positive 
and negative values. It is a good exercise to check that sine is not absolutely 
integrable. It turns out that the Fourier transform can still be defined for it, 
so lack of integrability is not a major worry. 

Example 18. Consider the Gaussian function 

fix) = e-^V2. 

By completing the square and adequately modifying the contour of integration 
in the complex plane (not part of the material for this class), it can be shown 
that 

Example 19. The Dirac delta 5{x) has a FT equal to 1 (why?). 

Another basic property of Fourier transforms is the convolution theorem. 

Theorem 11. (The convolution theorem.) Denote convolution as f -kg{x) — 
JZofiy)9{x -y)dy. Then 

f7g{k) = f{k)g{k)- 
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Proof. Let h — f ~k g. We can use Pubini below provided every function is 
integrable. 

h{k)^ j e-'^^ j f{y)g{x-y)dydx 

= j j e-'^yf{y)e-'^^''-y^^g{x-y) dydx 
^(^j e-^''yf{y)dy^ (^J e-''^' g{x') dx' 



f{k)m- 



□ 



The Fourier transform is an important tool in the study of linear differen- 
tial equations because it turns differential problems into algebraic problems. 
For instance, consider a polynomial P{x) — J2 ^n^^", and the ODE 

u{x) = f{x), X e 



dx 



which means X]^n|^ — f- (Such ODE are not terribly relevant in real life 
because they are posed over the whole real line.) Upon Fourier transforma- 
tion, the equation becomes 

P{ik)u{k)^f{k), 

which is simply solved as 

u(k) - M. 

and then 



1 



uix) = — / e'^'' 4rrTiT dk. 



Beware the zeros of P when applying this formula! They always carry im- 
portant physical interpretation. For instance, they could be resonances of a 
mechanical system. 

The formula u{k) = -pjj^ also lends itself to an application of the convo- 
lution theorem. Let K{x) be the inverse Fourier transform of 1/ P{ik). Then 
we have 



u{x) = J K{x - y)f{y)dy. 
The function K is called Green's function for the original ODE. 
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6.2 Sampling and restriction 

We aim to use Fourier transforms as a concept to help understand the ac- 
curacy of representing and manipulating functions on a grid, using a finite 
number of degrees of freedom. We also aim at using a properly discretized 
Fourier transform numerical tool itself. 

For this purpose, a; G M and k G M. must be replaced by x and k on finite 
grids. Full discretization consists of sampling and restriction. 

Let us start by sampling x G /iZ, i.e., considering Xj — jh for j G Z. The 
important consequence of sampling is that some complex exponential waves 
^tkx £qj, different k will appear to be the same on the grid Xj. We call aliases 
such functions that identify on the grid. 

Definition 15. (Aliases) The functions e*^ and e'^^a; q^j-^ aliases on the grid 
= jh if 

Aliases happen as soon as 

kijh — k2jh + 27r X integer (j). 
Letting j — 1, and calling the integer n, we have 

h 

for some n & Z. To wave numbers ki, k2 are indistinguishable on the grid if 
they differ by an integer multiple of 27i/h. 

For this reason, we restrict without loss of generality the wavenumber to 
the interval 

k G [-iT/h,7r/h]. 

We also call this interval the fundamental cell in frequency (in reference to 
a similar concept in crytallography.) 

Real-life examples of aliases are rotating wheels looking hke they go back- 
wards in a movie. Moire patterns on jackets on TV, and stroboscopy. 

The proper notion of Fourier transform on a grid is the following. 

Definition 16. LetXj — hj, fj — f{xj). Semidiscrete Fourier transform 
(SFT): 

oo 

/>) ^hY, e-''^^fj, k G [-n/h, n/h]. (6.3) 
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Inverse semidiscrete Fourier transform (ISFT): 



/, e^'^fik) dk. (6.4) 



As we saw, sampling in x corresponds to a restriction in k. If one still 
wanted to peek outside [—n/h, n/h] for the SFT, then the SFT would simply 
repeat by periodicity: 

(why?). That's why we restrict it to the fundamental cell. 

We can now define the proper notion of Fourier analysis for functions 
that are restricted to x in some interval, namely [— vr, tt] for convention. Un- 
surprisingly, the frequency is sampled as a result, the following formulas are 
dual to those for the SFT. 

Definition 17. Fourier series (FS): 

fk= r e-''''fix)dx. (6.5) 

J —TT 

Inverse Fourier series (IFS) 

^ oo 

/(^) = ^ E ^^[-^M- (6.6) 

fc=— oo 

If one uses the Fourier series inversion formula (6.6) for x outside of its 
intended interval [— 7r,7r], then the function simply repeats by periodicity: 

fix + 2711:)^ fix). 

(again, why?) 

The two formulas (6.5) and (6.6) can be justified quite intuitively. The 
expression J fix)gix) dx is an inner product on functions. It is easy to see 
that the complex exponentials 

form an orthonormal set of functions on [— 7r//i, 7r//i], for this inner product. 
Hence, up to normalization constants, (6.5) is simply calculation of the coef- 
ficients in an orthobasis (analysis), and (6.6) is the synthesis of the function 
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back from those coefficients. We'd fiave to understand more about tfie pecu- 
liarities of infinite-dimensional linear algebra to make this fully rigorous, but 
this is typically done in a real analysis class. 
Here's an example of SFT. 



Example 20. 



1 



X 



J J 



Then 



h 

_ _^ikah Q-ikjh 
j=0 

^ f^-ikh\2a+l _ ^ 

= — e**'"'' (geometric series) 

2a g-i/cft _ I 

_ h sin(fc/i(a + 1/2)) 

~ 2a sin(A;/i/2) ' 

This function is called the discrete sine. It looks like a sine, but it periodizes 
smoothly when k = —tt/H and k — ir/h are identified. 

Our first slogan is therefore: 

Sampling in x corresponds to restriction/periodization in k, and 
restriction/periodization in k corresponds to sampling in x. 



6.3 The DFT and its algorithm, the FFT 

The discrete Fourier transform is what is left of the Fourier transfom when 
both space and frequency are sampled and restricted to some interval. 
Consider 

Xj^jh, j^l,...,N. 

The point j = is identified with j — N hy periodicity, so it is not part of 
the grid. If the endpoints are xq — and xn — Stt, then N and h relate as 

27T TT N 
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For the dual grid in frequency, consider that points should be equispaced 
between the bounds [— 7r//i, 7r//i]. The resulting grid is 

AT 

k, k--- + l,...,-. 

We have the following definition. 
Definition 18. Discrete Fourier transform (DFT): 

N 

2 2 



h^hJ2e-''^'fj, k^-^,...,^. (6.7) 



Inverse discrete Fourier transform (IDFT) 

N/2 

27r 



N/2 

fj = ^ E i = (6-8) 

k=-N/2+l 

The DFT can be computed as is, by implementing the formula (6.7) di- 
rectly on a computer. The complexity of this calculation is a 0{N'^), since 
there are A^ values of j, and there are N values of k over which the compu- 
tation must be repeated. 

There is, however, a smart algorithm that allows to group the compu- 
tation of all the fk in complexity 0{N log N). It is called the fast Fourier 
transform (FFT). It is traditionally due to Tukey and Cooley (1965), but the 
algorithm had been discovered a few times before that by people who are not 
usually credited as much: Danielson and Lanczos in 1942, and well as Gauss 
in 1805. 

The trick of the FFT is to split the samples of / into even samples (j 
even) and odd samples {j odd). Assume that A^ is a power of 2. Then 

N 

fk^hY^e-'^^^f^ 

N/2 N/2 

^hJ2 e-''^'^^^f2j + hJ2 e-^'=('^'+^)^ /2,+i 

N/2 N/2 

j=i j=i 
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The first term on the last hne is simply the DFT of length N/2, on a 
grid of spacing 2h, of the even samples of /. The second term is, besides the 
multiplicative factor, the DFT of length N/2, on a grid of spacing 2h, of the 
odd samples of /. 

Note that those smaller DFTs would normally be only calculated for 

k — + 1, . . . , ^, but we need them for k = — ^ + 1, . . . , y. This is not 
a big problem: we know that the DFT extends by periodicity outside the 
standard bounds for k, so all there is to do is copy fk by periodicity outside 



Already, we can see the advantage in this reduction: solving one problem 
of size is essentially reduced to solving two problems of size N/2. Even 
better, the splitting into even and odd samples can be repeated recursively 
until the DFT are of size 1. When = 1, the DFT is simply multiplication 
by a scalar. 

At each stage, there are 0{N) operations to do to put together the sum- 
mands in the equation of the last line above. Since there are 0(log A") levels, 
the overall complexity is O(A^logA^). 

There are variants of the FFT when A^ is not a power of 2. 

6.4 Smoothness and truncation 

In this section, we study the accuracy of truncation of Fourier transforms 
to finite intervals. This is an important question not only because real- 
life numerical Fourier transforms are restricted in k, but also because, as 
we know, restriction in k serves as a proxy for sampling in x. It will be 
apparent in Chapter 2, section 2.1 that every claim that we make concerning 
truncation of Fourier transforms will have an implication in terms of accuracy 
of sampling a function on a grid, i.e., how much information is lost in the 
process of sampling a function f{x) at points Xj = jh. 

We will manipulate functions in the spaces L^, L^, and L°°. We have 
already encountered L^. 

Definition 19. Let I < p < oo. A function f of x & M. is said to belong to 
the space L^(M) when 



of = + 1,... 



N 
' 4 • 
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A function f of x is said to belong to L°° when 



ess sup < oo. 



Then the norm of f in L°°(R) is esssup 

In the definition above, "ess sup" refers to the essential supremum, i.e., 
the infimum over all dense sets X C R of the supremum of / over X. A 
set X is dense when 'M\X has measure zero. The notions of supremum and 
infimum correspond to maximum and minimum respectively, when they are 
not necessarily attained. All these concepts are covered in a real analysis 
class. For us, it suffices to heuristically understand the L°° norm as the 
maximum value of the modulus of the function, except possibly for isolated 
points of discontinuity which don't count in calculating the maximum. 

It is an interesting exercise to relate the L°° norm to the sequence of LP 
norms as p — > oo. 

We will need the very important Parseval and Plancherel identitites. They 
express "conservation of energy" from the physical domain to the frequency 
domain. 

Theorem 12. (Parseval's identity). Let f,g E (R) C] (R) . Then 



Proof. Let h be the convolution f -kg, where g{x) = g{—x). It is easy to see 
that the Fourier transform of g is g{k) (why?). By the convolution theorem 
(Section 1.1), we have 



If we integrate this relation over A; e M, and divide by 27r, we get the IFT at 




J-oo J-oo 



h{k) = f{k)g{k). 



x = 0: 




dk. 



On the other hand. 




□ 
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Theorem 13. (Plancherel's identity) . Let f E L^{R) n L'^(R). Then 

/OO 1 /'CxD 

jf{x)\'dx^-j jmi'dk. 

Proof. Apply Parseval's identity with g = f. □ 

(With the help of these formulas, it is in fact possible to extend their 
validity and the validity of the FT to f,g £ L^(R), and not simply f,g & 
L^(M.) n L^(]R). This is a classical density argument covered in many good 
analysis texts.) 

We need one more concept before we get to the study of truncation of 
Fourier transforms. It is the notion of total variation. We assume that the 
reader is familiar with the spaces C'^(R) of bounded functions which are k 
times continuously differentiable. 

Definition 20. (Total variation) Let f G C^(]R). The total variation of f is 
the quantity 

/oo 
\f'{x)\dx. (6.9) 
-oo 

For functions that are not , the notion of total variation is given by either 
expression 

WfWrv - Jim r dx = sup ^ \f{x,^^)-f{x,)\, 

J —oo I"'! {Xp} finite subset of K. ^ 

(6.10) 

These more general expressions reduce to \f'{x)\dx when f G C^(M). 
When a function has finite total variation, we say it is in the space of func- 
tions of bounded variation, or B l/(M) . 

The total variation of a piecewise constant function is simply the sum of 
the absolute value of the jumps it undergoes. This property translates to a 
useful intuition about the total variation of more general functions if we view 
them as limits of piecewise constant functions. 

The important meta-property of the Fourier transform is that 

decay for large \k\ corresponds to smoothness in x. 
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There are various degrees to which a function can be smooth or rates at 
which it can decay, so therefore there are several ways that this assertion 
can be made precise. Let us go over a few of them. Each assertion either 
expresses a decay (in k) to smoothness (in x) imphcation, or the converse 
imphcation. 

• Let f e L^(M) (decay), then f e L°°(M) and f is continuous (smooth- 
ness). That's because \e^'^^\ = 1, so 

1/(^)1 \e''^m\dk=^ J \f{k)\dk, 

which proves boundedness. As for continuity, consider a sequence y„ — > 
and the formula 



/(^ - = ^ / dk. 



The integrand converges pointwise to e''^^/(/c), and is uniformly bounded 
in modulus by the integrable function |/(A;)|. Hence Lebesgue's dom- 
inated convergence theorem applies and yields f{x — y„) —>■ f{x), i.e., 
continuity in x. 

Let f{k){l + \k\P) e L\R) (decay). Then f e (smoothness). We 
saw the case p = above; the justification is analogous in the general 
case. We write 

1/"^^)! \e''^{ikrm\dk< j \k\^\f{k)\dk, 

which needs to be bounded for all < n < p. This is obviously the 
case if f{k){l + \k\P) e Li(M). Continuity of is proved hke before. 

Let f e 5F(M) (smoothness). Then f{k) < \\f\\Tv 1^1"^ (decay). If 
f e f] BV{M), then this is justified very simply from (6.9), and 

ikf{k) = J e-'^''f{x) dx. 

Take a modulus on both sides, and get the desired relation 

\k\\fm< j \nx)\dx = \\f\\Tv<^. 
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When / G BV{W), but f ^ C^, either of the more general formulas 
(6.10) must be used instead. It is a great practice exercise to articulate 
a modified proof using the lim/j^o formula, and properly pass to the 
limit. 

• Let f such that f^^^ E L^(]R) for < k < p, and assume that G 
i?F(M) (smoothness). Then there exists C > such that \f{k)\ < 
1^1 (decay). This claim is also formulated as point (a) in Theo- 
rem 1 on page 30 of Trefethen's "Spectral methods in Matlab" . The 
justification is very simple when / G C^'^^: we then get 

so 

\kr'\f{k)\< J \fP+'\x)\dx=\\fP^\\Tv<oc. 

Again, it is a good exercise to try and extend this result to functions 
not in CP+\ 

• (This is the one we'll use later). (Same proof as above.) 

In summary, let / have p derivatives in L^. Then \f{k)\ < C\k\~P. This 
is the form we'll make the most use of in what follows. 

Example 21. The canonical illustrative example for the two statements in- 
volving bounded variation is that of the B-splines. Consider 

«(^) = ^X[-i,i](^)' 

the rectangle- shaped indicator of [—1, 1] (times one half). It is a function in 
BV{E.), hut it has no derivative in L^(]R). Accordingly, its Fourier transform 
s{k) is predicted to decay at a rate ~ \k\~^ . This is precisely the case, for we 
know 



Functions with higher regularity can he ohtained hy auto- convolution of s; for 
instance S2 = s -k s is a triangle-shaped function which has one derivative in 
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L"^, and such that s'^ G BV{M.). We anticipate that S2{k) would decay like 
and this the case since by the convolution theorem 

Any number of autoconvolutions s-ks . . .-ks can thus be considered: that's the 
family of B-splines. 

The parallel between smoothness in x and decay in k goes much further. 
We have shown that p derivatives in x very roughly corresponds to an inverse- 
polynomial decay |A;|~^ in A;. So C°° functions in x have so called super- 
algol)iaic decay in k\ faster than Cp|/c|~^ for all p > 0. 

The gradation of very smooth functions goes beyond C°° functions to 
include, in order: 

• analytic functions that can be extended to a strip in the complex plane 
(like f{x) = 1/(1 + a;^)), corresponding to a Fourier transform decaying 
exponentially (in this case f{k) = Tre"''^'). That's Paley- Wiener theory, 
the specifics of which is not material for this course. 

• analytic functions that can be extended to the whole complex plane 
with super-exponential growth (like f{x) = e~^ /^), whose Fourier 
transform decays faster than exponentially (in this case f(k) = y^Tr/Se"^ /^). 

• analytic functions that can be extended to the whole complex plane 
with exponential growth at infinity (like f{x) = ^^), whose Fourier 
transform is compactly supported (in this case f{k) = 27rx[_i,i](A;)). 
That's Paley- Wiener theory in reverse. Such functions are also called 
bandlimited. 

More on this in Chapter 4 of Trefethen's book. 

An important consequence of a Fourier transform having fast decay is 
that it can be truncated to some large interval k G [—N, N] at the expense 
of an error that decays fast as a function of A^. The smoother /, the faster 
/ decays, and the more accurate the truncation to large N in frequency. On 
the other hand, there may be convergence issues if f{x) is not smooth. 

To make this quantitative, put 

fN{k) = X[-N,N]{k)f{k). 
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Recall that ^ e^'^^dx — ■ By the convolution theorem, we therefore 

have . ^ 

fN{x) = -><f{x). 

TTX 

In the presence of / e iv^(M), letting — > oo always gives rise to conver- 
gence /jv — > / in L^(]R). This is because by Plancherel's identity, 

ii/-/ivii2 = ^ r \fN{k)-m\'dk = ^ [ \m\'dk. 

^TT y_oo 27r J\k\>N 

This quantity tends to zero as — > oo since the integral over the 

whole line is bounded. 

The story is quite different if we measure convergence in L°° instead of 
L^, when / has a discontinuity. Generically, L°° (called uniform) convergence 
fails in this setting. This phenomenon is called Gibb's ejfect, and manifests 
itself through ripples in reconstructions from truncated Fourier transforms. 
Let us illustrate this on the Heaviside step function 



U(X} 



1 if a; > 0; 
if X < 0. 



(It is not a function in L^(R) but this is of no consequence to the argument. 
It could be modified into a function in L^(M) by some adequate windowing.) 
Since u{x) is discontinuous, we expect the Fourier transform to decay quite 
slowly. Consider the truncated FT 

UN{k) = X[-N,N]{k)u{k). 

Back in the x domain, this is 

, , sinNx , , 

'>J'N{x) = i<:U[X) 

TTX 

"OO 



sin N{x — y) 

7i{x 
siny 



7r{x - y) 

Nx 

Nl I I 7/ 

dy 



dy 



oo Try 
s(Nx). 



(Draw picture) 
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The function s{Nx) is a sigmoid function going from at — oo to 1 at cxo, 
going through s(0) = 1/2, and taking on min. value ~ —.045 at a; = —tt/N 
and max. value ~ 1.045 at x = tt/N. The oscillations have near-period tt/N. 
The parameter N is just a dilation index for the sigmoid, it does not change 
the min. and max. values of the function. Even as A?" — > oo, there is no way 
that the sigmoid would converge to the Heaviside step in a uniform manner: 
we always have 

\\u - unWoo > -045. 

This example of Gibb's effect goes to show that truncating Fourier expan- 
sions could be a poor numerical approximation if the function is nonsmooth. 

However, if the function is smooth, then everything goes well. Let us 
study the decay of the approximation error 

for truncation of the FT to [—N, N]. Again, we use Plancherel to get 

J\k\>N 

Now assume that f{k) < C\k\~P, a scenario already considered earlier. Then 

€%<C [ \k\-^P < C N-^P+\ 

J\k\>N 

SO, for some constant C" (dependent on p but independent of A^, hence called 
constant) , 

eN<C"N-P+'/''. 

The larger p, the faster the decay as — > oo. 

When a function is C°°, the above decay rate of the approximation error 
is valid for all p > 0. When the function is analytic and the Fourier trans- 
form decays exponentially, the decay rate of the approximation error is even 
faster, itself exponential (a good exercise). In numerical analysis, either such 
behavior is called spectral accuracy. 

6.5 Chebyshev expansions 

Chebyshev expansions, interpolation, differen- 
tiation, and quadrature rules are not part of 
the material for 18.330 in 2012. 



16 



6.5. CHEBYSHEV EXPANSIONS 



In the previous sections, we have seen that smooth functions on the real 
hue have fast decaying Fourier transforms. 

On a finite interval, a very similar property hold for Fourier series: if a 
function is smooth in [—71, tt] and connects smoothly by periodicity at a; = —n 
and X — then its Fourier series (6.5) decays fast. Periodicity is essential, 
because the points x — —tt and x — tt play no particular role in (6.5). They 
might as well be replaced by a; = and x = 271 for the integration bounds. 
So if a function is to qualify as smooth, it has to be equally smooth at x — 
as it is at a; = — tt, identified with a; = tt by periodicity. 

For instance if a function / is smooth inside [— tt, tt], but has /(— tt) ^ 
fiir), then for the purpose of convergence of Fourier series, / is considered 
discontinuous. We know what happens in this case: the Gibbs effect takes 
place, partial inverse Fourier series have unwelcome ripples, and convergence 
does not occur in L°°. 

If / is smooth and periodic, then it is a good exercise to generalize the 
convergence results of the previous section, from the Fourier transform to 
Fourier series. 

How shall we handle smooth functions in intervals [a,b], which do not 
connect smoothly by periodicity? The answer is not unique, but the most 
standard tool for this in numerical analysis are the Chebyshev polynomials. 

For simplicity consider x G [—1, 1], otherwise rescale the problem. Take 
a C'^ nonperiodic f{x) in [—1, 1]. The trick is to view it as g{9) = f{cos9). 
Since cos[0, tt] = cos[7r, 27r] = [—1, 1], all the values of x e [—1, 1] are covered 
twice by ^ e [0, 27i]. Obviously, at any point 9, g inherits the smoothness of 
/ by the chain rule. Furthermore, g is periodic since cos 9 is periodic. So g is 
exactly the kind of function which we expect should have a fast converging 
Fourier series: 



/ e-'^^'g{9) d9, g{9) = ^ ^ ^ ^ ^ 

^ k=—oo 



Since g{9) is even in 9, we may drop the sin^ terms in the expansion, as well 
as the negative k: 



gk = / cos {k9)g{9) d9, g{9) = — + - V cos {k9)gk, k e Z+. 
Jo 27r TT ^ 
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CHAPTER 6. FOURIER ANALYSIS 



Back to /, we find 



/ cos (/carccosa:)/(a;) . =, f{x) — 7^go'\ — cos (/c arccosa;)^^, 

J-l Vl-X^ ZTT 



The function cos(A; arccosx) happens to be a polynomial in x, of order 
k, called the Chebyshev polynomial of order k. Switch to the letter n as is 
usually done: 

Tn{x) = cos(narccosa;), x G [—1, 1]. 
The first few Chebyshev polynomials are 

To{x) = 1, Ti{x) = X, T2{x) = - 1. 

It is a good exercise (involving trigonometric identities) to show that they 
obey the recurrence relation 

Tn+l{x) = 2xTn{x) - Tn-l{x). 

The orthogonality properties of T„(x) over [—1,1] follows from those of 
cos(n^) over [0,7r], but watch the special integration weight: 

-1 Vl-x^ 

with c„ = 7r/2ifn = and c„ = tt otherwise. The Chebyshev expansion of 
/ is therefore 

T4x)f{x) -j=^, fix) = -(/,To) + - 5]T,(a;)(/,T„), k G Z+. 



Under the hood, this expansion is the FS of a periodic function, so we can 
apply results pertaining to Fourier series and Fourier transforms to obtain 
fast decay of Chebyshev expansions. This way, spectral accuracy is restored 
for nonperiodic functions defined on an interval. 
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